Migemo: Incremental Search Method for Languages with Many Character Faces
نویسندگان
چکیده
We introduce a new incremental search method called Migemo for languages with many character faces. Migemo performs the incremental search by dynamically expanding the input pattern into a compact regular expression which represents all the possible words that match the input pattern. We show that Migemo is useful not only for searching texts in Japanese and other East Asian languages, but also for performing sophisticated searches on ASCII-only documents.
منابع مشابه
Doctor ’ s Thesis Synthetic Assistance for Creation and Communication of Information
As the Internet becomes popular, circulation of information increased rapidly and many researchers have actively studied how to create and share information effectively. Creation and sharing of information can be seen as a continuous process of 1) how to find necessary information, 2) how to arrange the information, and 3) how to share the information with others. We consider that search and co...
متن کاملSriShell Primo: A Predictive Sinhala Text Input System
Sinhala, spoken in Sri Lanka as an official language, is one of the less privileged languages; still there are no established text input methods. As with many of the Asian languages, Sinhala also has a large set of characters, forcing us to develop an input method that involves a conversion process from a key sequence to a character/word. This paper proposes a novel word-based predictive text i...
متن کاملA Hybrid Meta-Heuristic Method to Optimize Bi-Objective Single Period Newsboy Problem with Fuzzy Cost and Incremental Discount
In this paper the real-world occurrence of the multiple-product multiple-constraint single period newsboy problem with two objectives, in which there is incremental discounts on the purchasing prices, is investigated. The constraints are the warehouse capacity and the batch forms of the order placements. The first objective of this problem is to find the order quantities such that the expected ...
متن کاملSubstring-based unsupervised transliteration with phonetic and contextual knowledge
We propose an unsupervised approach for substring-based transliteration which incorporates two new sources of knowledge in the learning process: (i) context by learning substring mappings, as opposed to single character mappings, and (ii) phonetic features which capture cross-lingual character similarity via prior distributions. Our approach is a two-stage iterative, boot-strapping solution, wh...
متن کاملArabic Hand Written Character Recognition Using Modified Multi-Neural Network
Hand written recognition is an interesting area of current artificial intelligence and advanced computing’s researchers. The complexity of the language controls the ability and the challenge of recognition its characters, whereas this complexity and uncertainty becomes multiplied. The use of Latin languages like English, or Spanish, limits the uncertainty because of the limited structure of the...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2001